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Abstract. The Moderate Resolution Imaging Spectrometer (MODIS) observing facility will operate on 
the Earth Observing System (EOS) in the late 1990's. It estimated that this observing facility will 
produce over 200 gigabytes of data per day requiring a storage capability of just over 300 gigabytes per 
day. Archiving, browsing, and distributing the data associated with MODIS represents a nch 
opportunity for testing and applying both lossless and lossy data compression methods. 


1. Introduction 

MODIS is a multispectral imaging system to be flown on the EOS in the late 1990's. The capability of 
the MODIS instrument derives from and expands upon some instruments that have been successfully 
flown on spacecraft or aircraft and used for many years to observe properties of the earth-atmosphere 
system and to develop data bases for studies of global change. These instruments are the Advanced 
Very High Resolution Radiometer (AVHRR) and the High Resolution Infrared Sounder (HIRS) being 
currently flown on the NOAA operational meteorological satellites, the Coastal Zone Color Scanner 
(CZCS) flown on the Nimbus-7, the Landsat Thematic Mapper, and various aircraft scanners. 

The MODIS system will be composed of two cross-track scanning instruments. One instrument is 
called MODIS-N (nadir), indicating a multispectral scanner that will not be tilted and provide a 
continuous cross-track scan. The other instrument is called MODIS-T (tilt), indicating a scanner t at 
will allow the cross-track scan to be tilted 50° fore and aft. The MODIS-N will have 36 spectral bands 
covering spectral bands in the visible, near-infrared (0.7-1 microns), the sh °rt-wave infrared ( (1- 3 
microns), and the thermal infrared (3-15 microns). Tables 1 and 2 summarize MODIS-N and Table 3 
summarizes MODIS-T. The purposes of the bands in Table 2 are only indicative and not complete 
Further details concerning MODIS-T and MODIS-N instrumentation are given in Salomonson [2] and 
Magner and Salomonson [ 1 ] . 

Data volume coming from the MODIS observing facility will be on the order of a terabit of data per day 
depending upon the total time the instruments are on and the amount of ancillary information acquired 
concerning spacecraft attitude, etc. It is quite appropriate, therefore, to consider the MODIS as a case 
study for data compression methods that would reduce the volume of data involved for archiving, 
distribution, and analysis. This paper will describe in more quantitative detail the volumes and rates of 
data associated with the MODIS. The objective of this discussion is to provide those interested in 
applying data compression methods to MODIS, as a relevant and challenging example, with particulars 
that should help in assessing the magnitude and complexities of the task. 


2. MODIS Data Volumes and Rates 

In the broadest sense, it is envisioned that data compression methods might be most appropriately 
applied to MODIS data for the purposes of data storage, data distribution, providing a browsing 
capability, or facilitating the direct broadcast of MODIS data to terminals on the ground where only a 
subset or specific parameter of the data is needed or where reduced data volume or rate is needed in 
order to be compatible with limited receiving or processing capabilities. It is assumed in this paper that 
all information for MODIS must be retained in processing, at least through level 1, if not to level 2. I he 
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reason for being resolute with regard to the level 1 data processing is that from these data are derived all 
level 2 products. The level 2 products, of which there will be as many as 100, must utilize all the 
radiometric and calibration fidelity in level 1 . Even in the storage of MODIS data, particularly in the 
case of level 1, all information must be retained. This, therefore, indicates that only lossless data 
compression methods should be applied to data storage for level 1, and perhaps level 2 and above. 
Lossy data compression methods are deemed, at this point in the author's understanding, appropriate for 
producing browse data or for very specific applications wherein the loss of information can be tolerated. 

Figure 1 depicts the overall data flow for the MODIS. This figure shows that MODIS data will flow 
from the EOS platform through the Tracking and Data Relay Satellite (TDRS) to the Customer and Data 
Operations Systems (CDOS) and into the Goddard Space Flight Center (GSFC) Distributed Active 
Archive Center (DAAC). In the DAAC, data will be processed after algorithms have been developed 
and checked for accuracy and quality by the MODIS Science Team Members using the several and 
distributed Team Member Computing Facilities (TMCF's) and the MODIS Team Leader Computing 
Facility (TLCF). When products are produced in the appropriate DAAC, they will be archived and 
distributed to the scientific community and any part of the public at large that desires to use MODIS 
data. The distribution of the data will occur through the EOS Data and Information System (EOSDIS) 
electronic network that exists when MODIS becomes operational. 

As further detail, Table 4 shows specifics concerning calculated data rates and volumes associated with 
MODIS-N and -T. In this table, it is worth noting that the 13th bit for MODIS-T data is only included to 
flag the gain used in sending MODIS-T data to the ground (see [1]). In going from level 1 A (calibration 
and navigation information provided in the header, but not applied) to level IB (calibration information 
data applied and pixel location available), the increased volume is due principally to converting the 12- 
bit information to 16 bits and adding navigation, calibration, and browse information. 

Table 5 shows the archiving requirements in gigabytes per day for MODIS-N and -T. These are rough 
estimates based on preliminary estimates using existing or planned algorithms for the principal products 
to be derived from the MODIS. In many cases these estimates are based upon experience from the 
heritage instruments indicated in the Introduction. From a similar perspective, the expected lines of 
code estimates shown in Table 6 have been derived. In general, the bulk of the effort for producing 
lines of code and storing the results falls in producing at-satellite radiances (levels 1A and IB) and in 
producing water leaving and land leaving radiances. 

Table 7 shows the estimated load on the data distribution system in providing MODIS data to other 
DAAC's in the EOSDIS. The other main DAAC, besides the GSFC DAAC, is at the EROS Data Center 
(EDC) in Sioux Falls, South Dakota. At the EDC, all level 2 products produced over land areas will be 
archived and all level 3 land products will be produced and distributed from the EDC. Other key 
DAAC's are at the Langley Research Center in Virginia and the National Snow and Ice Data Center in 
Boulder, Colorado. The assumptions as to the fraction of MODIS data that will go to these DAAC's is 
provided in the third column of Table 7. 

With regard to direct broadcast of MODIS data and the volume of MODIS browse data, the following 
statements are provided. For direct broadcast it has been assumed that 100 percent of the raw data 
would be involved, but, in this instance, on-board data compression techniques could be examined. If 
such an approach is to be used, however, that must be decided soon (i.e., in a year or 2) in order for it to 
be implemented with the instrument or on the EOS spacecraft. The situations surrounding direct 
broadcast from the EOS are relatively undefined, but one may assume that a 15 megabit direct broadcast 
link will be available to be shared among the instruments. MODIS, of course, has the potential for 
occupying a large share of this capability unless data compression is applied appropriately. In the case 
of browse data, by assuming that browse data will be comprised of 5 percent of levels IB, 2, and 3, 
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END-TO-END MODIS DATA FLOW FROM INSTRUMENT TO USERS 
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resulting in an estimate of about 12 gigabytes of data per day. Browse data is a prime candidate for 
applying lossy compression methods. 


In the case of archiving MODIS data, it has already been indicated that no information should be lost in 
archiving the data. However, lossless compression methods could, and perhaps should, be applied that 
a the progressive extraction of archived data at various levels of accuracy depending upon the 
amount of information actually needed. This means that if the data are compressed appropriately, one 
could access the archive and extract first-order information. If this initial extraction indicates further 
information is needed, another pass through the compressed archive could result in higher-order 
information. Ultimately it appears that data compression methods are available for archived data 
wherein the complete information available in the original data stream ultimately could be retrieved. 


3, Summary and Conclusions 

The MODIS provides a rich opportunity for applying data compression methods for archiving, 
browsing, and distribution. Lossless methods should be developed for archiving that allow eventual 
extraction of all the information contained in the MODIS. Lossy methods can very appropriately be 
applied in order to browse MODIS data and distribute it for quick-look analyses. The challenges 
include costs of developing and applying data compression methods including associated hardware 
-i* e availability of off-the-shelf versus special purpose hardware and software, demonstrating 
reliability and low risk of losing information for lossless methods plus, in many cases, making the 
application of data compression transparent to the average user. 
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TABLE 1 


MODIS-Nadir (N) Summary 


PARAMETERS 


DESIGN SPECIFICATION S OR 
EXPECTED PERFORMANCE 


PLATFORM ALTITUDE 
IFOV (no. of bands @ IFOV) 


SWATH 

SPECTRAL BANDS 
RADIOMETRIC ACCURACY 


QUANTIZATION 

POLARIZATION SENSITIVITY 

MODULATION TRANSFER FUNCTION 

S/N PERFORMANCE 

(70 DEGREE SOLAR ZENITH/OCEANS) 

NEDT PERFORMANCE (THERMAL BANDS) 
@ 300 DEG K/WINDOW BANDS 

SCAN EFFICIENCY 

INTEGRATION TIME 

SIZE (APPROX) 

WEIGHT 

POWER 

PEAK DATA RATE 
DUTY CYCLE 


705 KM 

29 @ 1000 M 
5 @ 500 M 
2 @ 250 M 

110 DEG/2330 KM 

36 BANDS TOTAL 
(19/0.4-3.0 pm; 17/3-15 urn) 

5% ABSOLUTE, < 3 urn 
1% ABSOLUTE, > 3 urn 
(possibly < 0.3%) 

2% REFLECTANCE 

12 BIT 

2% MAX, < 2.2 pm 

0.3 AT NYQUIST 

830:1 (443 nm) 

745:1 (520 nm) 

503:1 (865 nm) 

LESS THAN 0.05 

(TO BE DETERMINED) 

(TO BE DETERMINED) 

I X 1.6 X 1 M 
APPROX 200 kg 
250 w 

I I MBS (daytime) 

100 % 
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TABLE 2 


MODIS-N Bands 

BAND CENTER * IFOV (ml WIDTH PURPOSE 

LAND AND CLOUD BOUNDARIES BANDS 


1 

659 

250 50 

VEG CHLOROPHYLL ABS 





LAND COVER TRANS. 

2 

865 

250 

40 

CLOUD AND VEGETATION 





LAND COVER TRANSF. 


LAND AND CLOUD PROPERTIES BANDS 


3 

470 

500 

20 

SOIL, VEG DIFFRNCS 

4 

555 

500 

20 

GREEN VEGETATION 

5 

1240 

500 

20 

LEAF/CANOPY PROPRTIES 

6 

1640 

500 

20 

SNOW/CLOUD DIFFRNCES 

7 

2130 

500 

50 

LAND & CLOUD PROPRTIES 


OCEAN COLOR BANDS 



8 

415 

1000 

15 

CHLOROPHYLL 

9 

443 

1000 

10 

CHLOROPHYLL 

10 

490 

1000 

10 

CHLOROPHYLL 

11 

531 

1000 

10 

CHLOROPHYLL 

12 

565 

1000 

10 

SEDIMENTS 

13 

653 

1000 

15 

SEDIMENTS, ATMOSPHERE 

14 

681 

1000 

10 

CHLOR. FLUORESCENCE 

15 

750 

1000 

10 

AEROSOL PROPERTIES 

16 

865 

1000 

15 

aerosoivatm prprts 


ATMOSPHERE/CLOUD BANDS 



17 

905 

1000 

30 

CLOUD/ATM PROPERTIES 

18 

936 

1000 

10 

CLOUD/ATM PROPERTIES 

19 

940 

1000 

50 

CLOUD/ATM PROPERTIES 


THERMAL BANDS 



20 

3.75 

1000 

0.18 

SEA SURFACE TEMP 

21 

3.75 

1000 

0.05 

FOREST FIRES/VOLCANOES 

22 

3.96 

1000 

0.05 

CLOUD/S FC TEMPERATURE 

23 

4.05 

1000 

0.05 

CLOUD/S FC TEMPERATURE 

24 

4.47 

1000 

0.05 

TROP TEMP/CLD FRACTION 

25 

4.52 

1000 

0.05 

TROP TEMP/CLD FRACTION 

26 

4.57 

1000 

0.05 

TROP TEMP/CLD FRACTION 

27 

6.72 

1000 

0.36 

MID-TROP HUMIDITY 

28 

7.33 

1000 

0.30 

UPPER-TROP HUMIDITY 

29 

8.55 

1000 

0.30 

SFC TEMPERATURE 

30 

9.73 

1000 

0.30 

TOTAL OZONE 

31 

11.03 

1000 

0.50 

CLOUD/SFC TEMPERATURE 

32 

12.02 

1000 

0.50 

CLOUD/SFC TEMPERATURE 

33 

13.34 

1000 

0.30 

CLD HEIGHT & FRACTION 

34 

13.64 

1000 

0.30 

CLD HEIGHT & FRACTION 

35 

13.94 

1000 

0.30 

CLD HEIGHT & FRACTION 

36 

14.24 

1000 

0.30 

CLD HEIGHT & FRACTION 


* BAND CENTER AND BANDWIDTH ARE IN NANOMETERS FOR BANDS 1-19 AND 
MICROMETERS FOR BANDS 20-36 
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TABLE 3 


MODIS-Tilt (T) Summary 


PARAMETERS 


PLATFORM ALTITUDE 

IFOV 

SWATH 

SPECTRAL BANDS (10-15 nm WIDTH) 
DYNAMIC RANGE 
RADIOMETRIC ACCURACY 


DESIGN 

SPECIFICATIONS 
OR EXPECTED 
PERFORMANCE 

705 KM 

1.4 MRAD (1.1 KM) 

90 DEG/1500 KM 

32 (400-880 nm.) 
(AREA ARRAY) 

Lmax 95% @ 22.5 deg 
solar zenith angle 


5% absolute 
2% relative to the sun 


QUANTIZATION 12 BIT 

POLARIZATION SENSITIVITY <2.3% 

(< 20 deg tilt) 

MODULATION TRANSFER FUNCTION 0.3 AT NYQUIST 

S/N PERFORMANCE (SPEC) 835:1 (440 nm) 

(70 DEGREE SOLAR ZENITH) 685: 1 (625 nm) 

400:1(845 nm) 


NEDT PERFORMANCE (THERMAL BANDS) N/A 


SCAN EFFICIENCY 

INTEGRATION TIME 
MODE) 

COLLECTING APERTURE (DIA) 
SIZE (APPROX) 

WEIGHT 

POWER 

PEAK DATA RATE 
DUTY CYCLE 


25% 

1.127 MSEC (COMPOSITE 
34 MM 

75 X 140 X 100 cm 
-170 kg 
-130 w 
-3 mbps (day) 
DAYTIME/100% 
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TABLE 4 


MODIS-N and MODIS-T Data Rate and Volume Estimates 


Earth Radius (km) 6371 

Satellite Altitude (km) 705 

Orbital Period (min) 98.9 

Modis-N # 1000 m REF channels 12 

Modis-N # 500 m REF channels 3 

Modis-N # 250 m REF channels 2 

Modis-N # 1000 m TIR channels 17 

Modis-N # 500 m NIR channels (1.6, 2.1 nm) 2 

Modis-T # 1.1 km REF channels 32 

MODIS-N # bit^REF channel 12 

MODIS-N # bits/TIR channel 12 

MODIS-T # bits/REF channel 13 

MODIS-N REF Duty Cycle 50% 

MODIS-N TIR Duty Cycle 100% 

MODIS-T REF Duty Cycle 45% 

MODIS-N # Along-track EFOVs f 

MODIS-T # Along-track EFOVs 30 

MODIS-N # Detectors 648 

MODIS-T # Along-track detectors 30 

MODIS-N # Maximum scan angle (deg) 55 

MODIS-T # Maximum scan angle (deg) 45 

MODIS-N # IFOV FWHM (deg) 8.13E-02 

MODIS-T # IFOV FWHM (deg) 8.94E-02 

MODIS-N # pixels along-scan/on-Earth 1354 

MODIS-T # pixels along-scan/on-Earth 1007 

MODIS-N Scan Period (sec) 1 .2 

MODIS-T Scan Period (sec) 4.6 

MODIS-N VIS Data (megabits/scan) 7.3 

MODIS -N TIR Data (megabits/scan) 3 . 2 

MODIS-N Daytime Data (megabits/scan) 10.5 

MODIS-T Daytime Data (megabits/scan) 12.6 

MODIS-N # Scans/Orbit 5000 

MODIS-T # Scans/Orbit 579 

MODIS-N Daytime Data Rate (mbps) 8.9 

MODIS-N Nighttime Data Rate (mbps) 2.7 

MODIS-T Daytime Data Rate (mbps) 2.7 

MODIS-N Orbital Ave Data Rate (mbps) 5.8 

MODIS-T Orbital Ave Data Rate (mbps) 1 .2 

MODIS-N Daily Data Volume (gigabytes) 62.6 

MODIS-T Daily Data Volume (gigabytes) 13.1 

Total Daily Data Volume (gigabytes) 75.8 

MODIS-NVolume (gigabytes) Level 1 A ~ 65.8 

MODIS-T Volume (gigabytes) Level-IA 1 3.8 

Total Daily Volume (gigabytes) @1 A 79.6 

MODIS-N Volume (gigabytes) Level-IB 1 1 3.6 

MODIS-T Volume (gigabytes) Level- IB 23. 1 

Total Daily Volume (gigabytes) @ IB 136.7 

Total Daily Volume (gigabytes) @1A& IB 216.3 
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TABLE 5 


MODIS Long-Term Archive Storage Requirements 
(Gigabytes Per Day) 

PRODUCT LEVEL 


DATA PRODUCT 


1A IB 2/T 2/N 3 TOTAL 


Navigation 

Calibration 

Spacecraft Ancillary 

At-Satellite Radiances 

Water-Leaving Radiances 

Single Scattering Aerosol Radiances 

Angstrom Exponents 

Chlorophyll-A Concentrations (Case 1) 

Chlorophyll-A Concentrations (Case 2) 

Chlorophyll-A Fluorescence 

CZCS Pigment Concentrations 

Sea-Surface Temperature 

Sea-Ice Cover 

Attenuation at 490 nm 

Detached Coccolith Concentration 

Phycoerythrin Concentrations 

Dissolved Organic Matter 

Suspended Solids 

Glint Field 

IPAR 

Ocean Cal Data Sets 
Primary Production (Oceans) 
Land-Leaving Radiances 
Topographically Corrected Radiance 
Vegetation Index 
Polarized Vegetation Index 
Land Surface Temperature 
Thermal Anomalies 
Evapo transpiration 
Primary Production (Land) 

Snow Cover 

Spacial Heterogeneity (not sized here) 

Land Cover Type 

Bidirectional Reflectance, BRDF 

Cloud Mask 

Cloud Fraction 

Cloud Effective Emissivity 

Cloud-Top Temperature and Pressure 

Cloud Optical Thickness (0.66 fm) 

Cloud Particle Effective Radius 

Cloud Particle Thermodynamic Phase 

Aerosol Optical Depth (0.41 to 2.13fm) 

Aerosol Size Distribution 

Aerosol Mass Loading 

Atmospheric Stability 

Total Precipitable Water 

Total Ozone 

Browse 

Metadata (Not sized here) 

Ocean Discipline Subtotal (L-2/3) 

Land Discipline Subtotal (L-2/3) 
Atmosphere Discipline Subtotal (L-2/3) 
Total 


18.7 18.7 

6.8 6.8 
4.3 4.3 

75.3 104.7 180.0 

10.1 4.0 6.6 20.7 

8.2 2.6 10.9 

0.3 0.4 0.8 

0.3 0.4 0.2 1.0 

0.0 0.0 0.2 0.3 

0.3 0.4 0.2 1.0 

0.3 0.4 0.2 1.0 

1.1 0.2 1.3 

0.1 0.1 0.2 

0.3 0.4 0.2 1.0 

0.1 0.1 0.2 0.5 

0.3 0.2 0.5 

0.3 0.4 0.2 1.0 

0.3 0.4 0.2 1.0 

0.3 0.4 0.2 1.0 

0.1 0.1 0.1 0.2 

0 

0.3 0.4 0.2 1.0 

2.7 14.4 2.8 20.0 

2.7 14.4 2.8 20.0 

3.8 1.7 5.5 

3.8 1.7 5.5 

0.5 0.2 0.6 

0.5 0.5 

0.1 0.1 

0.1 0.1 

0.2 0.0 0.2 

0 

0.0 0.0 

0.0 0.0 

0.3 1.6 1.9 

0.0 0.0 

0.1 0.0 0.1 

0.3 0.0 0.3 

0.1 0.0 0.1 

0.1 0.0 0.1 

0.0 0.0 0.0 

0.0 0.0 

0.0 0.0 

0.0 0.0 

0.1 0.0 0.1 

1.7 0.0 1.7 

0.1 0.0 0.1 

6.5 1.4 2.7 1.0 5.0 

0 

21.8 12.1 9.5 43.4 

5.4 37.6 9.5 52.5 

0.3 4.0 0.0 4.3 

79.6 136.7 28.9 56.4 20.0 315.0 
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TABLE 6 


Estimated MODIS Data Processing Requirements 
(Lines of Code) 



LAUNCH LOC 


Level- 1 A 

25,000 

25,000 

Level- IB 

25,000 

30,000 

Calibration/Monitor 

72,000 

144,000 

Level-2 Ocean 

12,000 

24,000 

Level-2 Land 

40,000 

80,000 

Level-2 Atmosphere 

20,000 

40,000 

Level-2 Shell 

30,000 

30,000 

Level-2 Utility 

40,000 

80,000 

Level-2 IDS Products 

36,000 

72,000 

Level- 3 

30,000 

60,000 

Near-Real-Time 

17,800 

81,500 

Subtotal 

347,800 

666,500 

Supporting Software 
(validation) 


552,000 

Total 


1.218.500 
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TABLE 7 


MODIS Data Distribution 
(Gigabytes Per Day) 



TO 



CDOS 

1 V 

GSFC 

All Level-0 Products 

76 

GSFC 

MODIS Investigators 

10% of Level- 1 A Products 
50% of Level- IB Products 
100% of Level-2 Products 
100% of Level-3 Products 

182 

GSFC 

Other Investigators 

5% of Level-IB Products 
10% of Level-2 Products 
10% of Level-3 Products 

17 

GSFC 

EDC 

Level- IB for Land Products 

41 

GSFC 

Langley Research Center 

100% of Level- IB Products 

137 

GSFC 

National Snow and 

Level- IB for Snow and 



Ice Data Center (NSIDC) 

Ice Products 

4 
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